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(57) ABSTRACT 

The present invention provides a method and apparatus 
(400) for iteratively decoding data which has been encoded 
with contatenated codes. The apparatus (400) includes pipe- 
lined and cascaded decoder processors (406, 430 and 436) 
connected to a multiple block memory device (402), through 
a multiplexing and data control block (404). A data decision 
element (437) is provided for generating decoded output 
data. The method includes receiving encoded data (802) 
while data already received is processed iteratively by 
decoder processors in a pipelined fashion. Decoder proces- 
sors are designated to perform particular iterations (810) of 
an iterative decoding process which are performed simulta- 
neously. As a decoder processor completes processing its 
designated iteration on a block of data, the decoder proces- 
sor outputs decoding information (808) to the decoding 
processor designated to perform the subsequent iteration. 
Upon completion of all iterations for a block of data, the 
method includes generating output (814) consisting of the 
decoded data block. The method provides that once pro- 
cessing is complete on a data block, the memory block is 
made available (816) for the storing of new encoded input 
data. 

18 Claims, 6 Drawing Sheets 



42 

402 


Encoded 




| Data Block 1 


I Data Sock 2 j D»ta Black 3 


Data Block* 1 


I C1 I C2 I C3 


I CI 1 C2 I C3 1 CI 1 C2 1 C3 


CI I C2 | C3 1 


«* 


ft 




| CI MUX | 


| C2MUX | 


| C3MUX ] 




02/25/2004, EAST Version: 1.4.1 



US 6,304,995 Bl 

Page 2 



OTHER PUBLICATIONS 

D. Divsalar and F. Pollara, "Multiple Turbo Codes for 
Deep-Space Communications", Communications Systems 
and Research Section, TDA Progress Report 42-121, Jet 
Propulsion Lab, NASA, May 15, 1995.* 
D. Raphaeli anfd Y Zarai, "Combined Turbo Equalization 
and T\irbo Decoding*', IEEE, 1997.* 
V. Franz and J. Anderson, "Concatenated Decoding with a 
Reduced-Search BCJR Algorithm", IEEE 1998.* 
L. Papke and K. Fazel, "Combined Multilevel Turbo-code 
with MR-modulation", IEEE, 1995.* 
S. Benedetto, D. Divsalar, G. Montorsi, and E Pollara, "A 
Soft-Input Soft-Out-put APP Module for Iterative Decod- 
ing of Concatenated Codes", IEEE, 1997.* 
D. Divsalar and F. Pollara, "Hybrid Concatenated Codes and 
Iterative Decoding", Communications Systems and 
Research Section, TDA Progress Report 42-130, Jet Pro- 
pulsion Lab, NASA, Aug. 15, 1997 * 



Blazek Z et al: "A DSP-Based Implementation of a Tur- 
bo-Decoder" IEEE Global Telecommunications Confer- 
ence, US, New York, NY; IEEE, 1998, pp. 2751-2755, 
XP000801545 ISBN: 0-7803^985-7, p. 2751, right-hand 
col., line 6-p. 2752, left-hand col., line 10; figure 1, p. 2753^ 
paragraph 4-last paragraph. 

Sklar B; "Turbo Code Concepts Made Easy, or How I 
Learned to communicate and Reiterate" Annual Military 
Communications Conference, US, New York, NY: IEEE, 
1997, pp. 20-26, XP000799643 ISBN: 0-7803-^250-X. 

Benedetto, S., Divsalar, D., Montorsi, G., and Pollara F., 
"Continuous Map Algorithms and Their Applications to 
Decode Pareallel and Serial Code Concatenations", Pro-* 
ceedings of the Fifth European Space Agency Workshop on 
Digital Signal Processing Techniques Applied to Space 
Communications, Sep. 1996, pp. 8.10-S.24. 

* cited by examiner 



02/25/2004, EAST Version: 1.4.1 



U.S. Patent Oct. 16, 2001 Sheet 1 of 6 US 6,304,995 Bl 



100 



102 




104 

A 



108 



106 



Encoder 1 



/ 



Encoder 2 



120 



Puncturer 



116 



118 



1^ 
122 



► C1 Bits 

124 

C2 Bits 
— t> C3 Bits 
126 



Figure 1 




02/25/2004, EAST version: 1.4.1 



U.S. Patent Oct. 16, 2001 Sheet 2 of 6 US 6,304,995 Bl 



300 



(C1+C2) 



L 



302 J 04 308 



306 



c c 

SIS01 „ 
M M 



L 



316 



314 



310 



311 



(C3) 



L 



322 



A 



TT 



312 



L 



308 



c c 

SIS02 ta 
M M 



/ 



319 



Dday | -»[7 



\ 



Decoded 
Data 



324 



1 



326 



TT 



320 



Figure 3 



02/25/2004, EAST version: 1.4.1 



U.S. Patent Oct. 16, 2001 Sheet 3 of 6 



US 6,304,995 Bl 



400 



Encoded 
Data 



402 



1 



Data Block 1 | Data Block 2 [ Data Block 3 | Data Block 4 | 




448 



Decoded 
Data 



Figure 4 



02/25/2004, EAST Version: 1.4.1 



U.S. Patent Oct. 16, 2001 Sheet 4 of 6 US 6,304,995 Bl 



500 



600 





r 






502^ 


Acquire Access 
to Block of 
Data 




i 


r 


504^ 


Performing First 
Decoding Iteration 
on Block of Data 
with First Decoder 
Processor 






r 


506^ 


Send Metrics from 
First Decoding 
Iteration to 
Second Decoder 
Processor 


508 v^T^V 
\/ More X 

Yes y< ^ a * a * n NeeaSs 



Iteration?. 



51a 



\ 



No 



( Stop ) 



Figure 5 



( Start ) 
► 



602 



\ 



Receive Metrics 
from (n-1)th 
Cascaded 
Decoder 



604 



\ 



Acquire Access 
to Data 

Corresponding to 
Received Metrics 



606 



\ 



Perform nth 
Decoding Iteration 
on Block of Data 
with nth Decoder 
Processor 



608 



\ 



Send Metrics from 
nth Decoding 
Iteration to (n+1)th 
Decoder Processor 




612 iNo 
^ Stop ) 



Figure 6 



02/25/2004, EAST Version: 1.4.1 



U.S. Patent 



Oct. 16, 2001 



Sheet 5 of 6 



US 6,304,995 Bl 



700 



( Start ) 











702^ 


Receive Metrics 
from Previous 
Decoder 
Processor 



704 



\ 



I 



Acquire Access 
to Data 

Corresponding to 
Received Metrics 



706 



\ 



Perform Last 
Decoding Iteration 
on Block of Data 
with Last Decoder 
Processor 



708 



Send Metrics from 
^ Last Decoding 
Iteration to Data 
Decision Element 



7m 



Yes 



More 
Data to 
Decode? 



712. 



No 



\ Stop ) 
Figure 7 



02/25/2004, EAST version: 1.4.1 



U.S. Patent Oct. 16, 2001 Sheet 6 of 6 



US 6,304,995 BI 



800 



802. , s 

V Encoded A 
y Data Arrives J 



804 



\ 



Perform First 
Decoding Iteration on 
Encoded Data with 
First Cascaded 
Decoder Processor 



810 



\ 



808 



Perform nth 
Decoding Iterative 
with nth Decoder 
Processor 



\ 



Prepare for Next 
Decoding Iteration 




812 



\ 



Pass Metrics from 
Last Decoding 
Iteration to Data 
Decision Element 



814 



Generate Decoded 
\ Output Data with 
Data Decision 
Element 



816 



\ 



Free the Memory 
Block Containing 
Encoded Data Just 
Decoded 



Decoding Block < 
Encoded Data Complete, 



nplete/ 



Figure 8 



02/25/2004, EAST version: 1.4.1 



US 6,304 ; 

1 

PIPELINED ARCHITECTURE TO DECODE 
PARALLEL AND SERIAL CONCATENATED 
CODES 

BACKGROUND OF THE INVENTION 5 

The present invention generally relates to the decoding of 
parallel and serial concatenated codes. More specifically, the 
present invention relates to a pipelined architecture and 
method for iteratively decoding parallel and serial concat- 
enated codes in order to minimize the effects of decoding 10 
processing speed on overall data communication rate. 

Data signals, in particular those transmitted over a typi- 
cally hostile RF interface, are susceptible to errors caused by 
interference. Various methods of error correction coding J5 
have been developed in order to minimize the adverse effects 
that a hostile interface has on the integrity of communicated 
data. This is also referred to as lowering the Bit Error Rate 
(BER), which is generally defined as the ratio of incorrectly 
received information bits to the total number of received 2Q 
information bits. Error correction coding generally involves 
representing digital data in ways designed to be robust with 
respect to bit errors. Error correction coding enables a 
communication system to recover original data from a signal 
that has been corrupted. Typically, the greater the expected 25 
BER of a particular communication link, the greater the 
complexity of the error correction coding necessary to 
recover the original data. In general, the greater the com- 
plexity of the error correction coding, the greater the inef- 
ficiency of the data communication. The greater inefficiency 3Q 
results from a reduction of the ratio of information bits to 
total bits communicated as the complexity of the error 
correction coding increases. The additional information 
introduced into the original body of data by error correction 
coding consumes spectrum bandwidth and processor cycles 35 
on both the transmitting and receiving ends of the commu- 
nication. 

In cases where the expected BER of a particular commu- 
nication link is substantially higher than the acceptable 
BER, a concatenated set of error correcting codes may be 40 
applied to the data in order to lower the BER to acceptable 
levels. Concatenated error correction coding refers to 
sequences of coding in which at least two encoding steps are 
performed on a data stream. Concatenated coding may be 
performed in series, where encoded data is subjected to 45 
further encoding, or in parallel where the original data is 
subjected to different encoding schemes to perform inter- 
mediate codes which are then further processed and com- 
bined into a serial stream. 

For example, in serially concatenated coding, where two 50 
error correction codes are concatenated, an "outer code" is 
applied to the original data followed by an "inner code" 
which is then applied to the original data already encoded 
with the outer code. Serially concatenated coded data may 
become quite complex, even in an error correction scheme 55 
involving the application of only two concatenated error 
correction codes. An outer code may take the form of a block 
code, such as a Reed-Solomon code, and an inner code may 
take the form of a convolutional code. 

Reed-Solomon block codes are organized on the basis of 60 
groups of bits referred to as symbols. To form symbols, an 
incoming serial bit stream may be stored as sequences of m 
individual bits (a symbol). The Reed-Solomon code has k 
information symbols (rather than bits), r parity symbols, and 
a total codeword length of n«k+r symbols. For 8-bit 65 
symbols, a Reed-Solomon codeword is typically 255 sym- 
bols in length. Allowing the codeword to correct up to 16 
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symbols requires 32 parity symbols, thereby leaving 223 
data symbols (for an effective code rate of 223/255 
(approximately 7/8)). 

A convolutional code is a type of error correcting code 
which transforms an input sequence of bits into an output 
sequence of bits through the use of a finite-state machine, 
where additional bits are added to the data stream to allow 
for error-correcting capability. Typically the amount of 
error-correction capability is proportional to the amount of 
additional bits added and the amount of memory preset in 
the finite-state machine encoder. The constraint length, K, of 
a convolutional code is proportional to the finite-state 
machine's memory, and the rate of the convolutional code 
(e.g. m/n with m<n) describes how many additional bits are 
added for every m information bits input (i.e., n-m bits 
added for each m information bits). The decoding complex- 
ity of a convolutional code increases exponentially with the 
constraint length. 

Next consider an example of a parallel concatenated turbo 
coding scheme. A block of data may be encoded with a 
particular coding method resulting in systematic bits and 
parity bits. Additionally, the original block of data may be 
rearranged with a permuter and then encoded with the same 
method as that applied to the original data resulting in 
systematic bits (which may be discarded) and parity bits. 
The two sets of encoded data are then further processed and 
merged into a serial bit stream. As with the case of serially 
concatenated coding, the complexity of parallel concat- 
enated coding depends on the chosen encoding scheme, and 
can become significantly complex. 

From the previous discussion, it is apparent that data 
encoded with a convolutional error correction coding 
scheme may become quite complex, even with only two 
levels of convolutional encoding. The amount of processing 
necessary to decode such convolutionally encoded data can 
be considerable. 

Parallel and serial concatenated codes are sometimes 
decoded using iterative decoding algorithms. One com- 
monly employed method of iterative decoding utilizes a 
single decoder processor where the decoder output metrics 
are fed back to the input of the decoder processor. Decoding 
is performed in an iterative fashion until the desired number 
of iterations have been performed. In order for the decoder 
processor to decode the encoded input data at the same rate 
as the input data is arriving, the decoder processor must 
process the encoded data at a rate faster than the rate of the 
incoming data by a factor at least equal to the number of 
iterations necessary. With this method of iterative decoding, 
the speed of the decoder processor becomes a significantly 
limiting factor in the system design. 

Another method of iterative decoding utilizes a number of 
decoder processors equal to the number of processing itera- 
tions necessary, each decoder processor operating indepen- 
dently and in parallel with the others. Each decoder proces- 
sor iteratively decodes its own block of data from start to 
finish. The decoder processors take turns processing incom- 
ing blocks of data. For example, in a system with three 
independent decoders operating in parallel, decoder one may 
decode blocks n, n+3, n+6, etc., decoder two may decode 
blocks n+1, n+4, n+7, etc., and decoder three may decode 
blocks n+2, n+5, n+8, etc. Each decoder processor may 
either have multiple blocks of dedicated memory or a 
complex multiplexing/demultiplexing scheme allowing 
decoder processors to share memory. Each parallel decoder 
processor also has its own data decision element. The 
outputs from the parallel decoder processors are multiplexed 
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to form a serial bit stream. It is apparent from the above BRIEF DESCRIPTION OF THE DRAWINGS 

discussion that with the parallel decoder processor method nG j iUustrates the structure of , basic t^^ding 
of iterative decoding, the quantity or hardware necessary to 

implement the method becomes a significantly limiting *„ .„ , r . . : 

factor in the system design. 5 FIG - 2 illustrates the structure of a basic turbo-decoding 

The need to maximize processing speed and minimize svstem * 

hardware requirements exists in the communications indus- FIG ; 3 illustrates an iterative turbo-decoding system with 

try. For example, nowhere is this need more apparent than in a sin Sl e decoder Processor. 

satellite communications systems where relatively large FIG. 4 illustrates a pipelined iterative turbo-decoding 

amounts of data are to be processed by relatively small *° system according to a preferred embodiment of the present 

amounts of hardware. The data throughput rate in satellite invention. 

communications systems is constantly pushing the envelope FIG. 5 illustrates a method for iteratively decoding 

of processing speed. However, there is also a great need to encoded data performed by a first cascaded decoder proces- 

minimize the amount of payload hardware because of cost, sor according to a preferred embodiment of the present 

weight, power consumption, and reliability concerns. Thus, 15 invention. 

there exists a need in the communications industry for an pic. 6 illustrates a method for iteratively decoding 

improved method of decoding data that maximizes process- encoded data performed by a middle cascaded decoder 

ing speed while minimizing the amount of hardware processor according to a preferred embodiment of the 

required. present invention. 

SUMMARY OF THE INVENTION F * G * ^ illustrates a method for iteratively decoding 

encoded data performed by a last cascaded decoder proces- 

It is an object of the present invention to provide an ^ according to a preferred embodiment of the present 

improved scheme for iteratively decoding data encoded with invention. 

concatenated codes. ^ FIG. 8 illustrates a method for iteratively decoding a 

It is another object of the present invention to provide a block of encoded data according to a preferred embodiment 

pipelined scheme for decoding data encoded with concat- 0 f me present invention, 
enated codes utilizing cascaded decoder processors. 

It is yet another object of the present invention to provide DETAILED DESCRIPTION OF THE 

a pipelined scheme for decoding data encoded with concat- 30 PREFERRED EMBODIMENT 

enated codes utilizing cascaded decoder processors, each of To illustrate a preferred embodiment of the present 

which process an iteration or subset of the total number of invention, a data encoding technique known as turbo-coding 

iterations, for iteratively decoding the encoded data. was chosen. General information regarding turbo-coding 

A preferred embodiment of the present invention provides can be found in "Continuous Map Algorithms and Their 

an apparatus and method for iteratively decoding data. The 35 Applications to Decode Parallel and Serial Code 

apparatus includes pipelined and cascaded decoder Concatenations" , Proceedings of the Fifth European Space 

processors, each of which perform an iteration, or subset of Agency Workshop on Digital Signal Processing Techniques 

the total iterations, of an iterative decoding scheme. Also Applied to Space Communications, September 1996, pgs. 

included is a multiple block memory device which stores the 8.10-8.24, which is included herein by reference in its 

input encoded data while the data is being processed by the 40 entirety. FIG. 1 illustrates a basic turbo-coding system 100, 

decoder processors. The input encoded data is stored in Data to be encoded ("raw data") is applied to the input 102 

blocks so that individual input encoded data blocks may be of a first encoder 104. The first encoder 104 performs a 

selectively accessed by different decoder processors. Mul- chosen type of encoding on the raw data bits and outputs two 

tip lexers are implemented to provide the decoder processors sets of bits. The first set of bits output from the first encoder 

selective access to specific input encoded data blocks. A data 45 104 are commonly referred to as systematic bits, which are 

decision element is provided for generating decoded output output from the systematic bit output 106 of the first encoder 

data and is connected to the output of the last decoder 104. Systematic bits are directly representative of the infor- 

processor in the cascade of decoder processors. mation contained in the raw data. The systematic bits output 

The method for iteratively decoding data includes receiv- from the first encoder 104 will also be referred to as the CI 

ing blocks of encoded input data into available memory 50 bits. The second set of bits output from the first encoder 104 

blocks while cascaded decoder processors are simulta- are commonly referred to as parity bits, which are output 

neously performing their respective decoding iterations on from the parity bit output 108 of the first encoder 104. Parity 

encoded data previously received and stored in occupied bits are a function of the states of the raw data bits, 

data blocks. Upon completion of their respective decoding The raw data is also applied to the input 110 of a permute r 

iterations, the decoder processors provide output 55 112. A permuter is a device which rearranges bits, typically 

information, also known as metrics, to their respective in a random manner. The permuter 112 rearranges the raw 

successor decoder processors in the cascade. The decoder data bits. The rearranged raw data bits output from the 

processors then proceed to perform their respective itera- permuter 112 are then applied to the input 114 of a second 

tions on their next respective blocks of encoded input data encoder 116. The second encoder 116 may be the same type 

based on metrics that have been received from their respec- 60 of encoder as the first encoder 104. As with the first encoder 

tive predecessor decoder processors in the cascade. The 104, the second encoder 116 outputs two sets of bits! 

method further includes generating output decoded data with Systematic bits are output from the systematic bit output 118 

a data decision element based on metrics information of the second encoder 116, and parity bits are output from 

received from the last decoder processor in the cascade. the parity bit output 120 of the second encoder 116. The 

Once the encoded input data stored in a memory block has 65 systematic bits output from the second encoder 116 are 

been decoded, the memory block is made available to store typically discarded since they generally contain the same 

new incoming encoded data. information as the systematic bits output from the first 
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encoder 104. The parity bits, however, are retained since 
they represent unique and important information. 

The parity bits output from the first encoder 104 parity bit 
output 108 are input to a puncturcr 122. Likewise, the parity 
bits output from the second encoder 116 parity bit output 120 5 
are input to the puncturer 122. Apuncturer is a device which 
compresses data, thereby eliminating unnecessary informa- 
tion. The puncturer 122 compresses the input parity bits in 
order to make more efficient use of communication channel 
bandwidth. The puncturer 122 output bits corresponding to 10 
compressed parity bits from the first encoder 104 are output 
from the first puncturer output 124. The puncturer 122 
output bits corresponding to compressed parity bits from the 
first encoder 104 will also be referred to as "the C2 bits". 
The puncturer 122 output bits corresponding to compressed 15 
parity bits from the second encoder 116 are output from the 
second puncturer output 126. The puncturer 122 output bits 
corresponding to compressed parity bits from the second 
encoder 116 will also be referred to as "the C3 bits". Prior 
to being modulated and transmitted over a communication 20 
channel, the CI bits, C2 bits and C3 bits will be combined 
in some manner to form a serial bit stream. 

On the receiving end of the communication channel, the 
signal is demodulated and must be decoded to extract the 
original raw data. FIG. 2 illustrates a basic turbo decoder 25 
200 that will serve to develop basic turbo decoder concepts, 
an understanding of which will assist in understanding the 
subsequent discussion. 

The systematic bits (CI bits) and first set of parity bits (C2 3Q 
bits) of turbo-encoded data are applied to the data input 202 
of a first Soft-Input-Soft-Output ("SISO") decoder 204. The 
metrics input 206 of the first SISO decoder 204 has a null 
input applied. The data output 208 of the first SISO decoder 
204 is discarded. The metrics output 210 of the first SISO 35 
decoder 204 is applied to the input of a permuter 211. The 
permuter 211 rearranges the bits of the input metric and 
applies the result to the metrics input 212 of a second SISO 
decoder 214. The second set of parity bits (C3 bits) are 
applied to the data input 216 of the second SISO decoder 4Q 
214. The second SISO decoder 214 performs a decoding 
operation based on the C3 bits and the metric bits from the 
permuter 211, which are permuted metric bits, originally 
output from the first SISO decoder 204. The data output 218 
of the second SISO decoder 214 is discarded. The metrics 45 
output 219 of the second SISO decoder 214 is applied to the 
input of an inverse-permuter 220. An inverse-permuter is in 
essence a permuter set up to restore permuted bits to their 
pre-permuted order. For example, if bit sequence A is 
permuted by a permuter to form bit sequence B, an inverse 5Q 
permuter would restore bit sequence B back to bit sequence 
A. 

In addition to being applied to the input of the permuter 
211, the metrics output 210 of the first SISO decoder 204 is 
applied to a delay circuit 222. The output of the delay circuit 55 
222 and the output of the inverse-permuter 220 are applied 
to inputs of an adder 224. The output of the adder 224 is 
applied to the input of a data decision processor 226 which 
outputs a bit stream of decoded data corresponding to the 
original raw data. 60 

The example decoder illustrated in FIG. 2, and discussed 
above, often does not provide adequately decoded data in 
one pass. It is typically utilized in an iterative architecture 
300 as shown in FIG. 3. In an iterative type of decoding 
system, an output metric resulting from a decoding iteration 65 
will typically be fed back to the metrics input of a decoder. 
The encoded data will then be decoded repeatedly with 
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better and better metrics until the metrics converge to values 
corresponding to high confidence levels. The total number of 
decoding iterations necessary to achieve the desired confi- 
dence level is typically a predetermined value. 

Comparing the decoding schemes illustrated in FIGS. 2 
and 3, the only architectural difference is a metric feedback 
path from the output of the inverse-permuter (220 in FIG. 2; 
320 in FIG. 3) to the input of the first SISO decoder (206 in 
FIG. 2, 306 in FIG. 3). 

Referring to the decoder processor 300 illustrated in FIG. 
3, the systematic bits (CI bits) and first set of parity bits (C2 
bits) of turbo-encoded data are applied to the data input 302 
of a first Soft-Input-Soft-Output ("SISO") decoder 304. The 
metrics input 306 of the first SISO decoder 304 is connected 
to the output from the inverse-permuter 320. The data output 
308 of the first SISO decoder 304 is discarded. The metrics 
output 310 of the first SISO decoder 304 is applied to the 
input of a permuter 311. The permuter 311 rearranges the 
bits of the input metric and applies the result to the metrics 
input 312 of a second SISO decoder 314. The second set of 
parity bits (C3 bits) are applied to the data input 316 of the 
second SISO decoder 314. The second SISO decoder 314 
performs a decoding operation based on the C3 bits and the 
metric bits from the permuter 311, which are permuted 
metric bits from the first SISO decoder 304. The data output 
318 of the second SISO decoder 314 is discarded. The 
metrics output 319 of the second SISO decoder 314 is 
applied to the input of the inverse-permuter 320. 

In addition to being applied to the input of the permuter 
311, the metrics output 310 of the first SISO decoder 304 is 
applied to a delay circuit 322. The output of the delay circuit 
322 and the output of the inverse-permuter 320 are applied 
to inputs of an adder 324. The output of the adder 324 is 
applied to the input of a data decision processor 326 which 
outputs a bit stream of decoded data corresponding to the 
original raw data. 

As stated earlier, the output of the inverse-permuter 320 
is connected to the metrics input 306 of the first SISO 
decoder 304. Thus, the decoder processor 300 performs the 
nth decoding iteration with an input metric resulting from 
the (n-l)th decoding iteration. The total number of iterations 
is typically pre-determined. However, there may be 
instances where flexibility is designed into an architecture to 
enable an adaptable number of iterations to be performed. 

As mentioned in the background section, a weakness of 
the single decoder processor/multiple iteration decoding 
scheme is that the decoder processor 300 must complete the 
iterative decoding processing as fast as the encoded data is 
arriving. Thus for an n-iteration decoding scheme, the 
decoder processor 300 must process a decoding iteration 
n-times as fast as the data is arriving. A pipelined solution to 
this problem utilizing cascaded decoder processors is illus- 
trated in FIG. 4. 

FIG. 4 illustrates an embodiment 400 of the present 
invention applied to the particular problem of iteratively 
decoding data encoded with turbo coding. The particular 
embodiment 400 illustrated is for iteratively decoding data 
by performing three decoding iterations. The example may 
be extended to include as many decoding iterations and 
cascaded decoder processors as necessary. 

Encoded data may be stored in consecutive data blocks as 
the encoded data arrives. The multiple-block memory 402 of 
the embodiment illustrated in FIG. 4 is divided into four data 
blocks, Data Block 1, Data Block 2, Data Block 3 and Data 
Block 4. Each data block is logically and/or physically 
divided into three sections, CI, C2 and C3. CI sections store 
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systematic bits, C2 sections store first sets of parity bits, and reason for this is that the first decoder processor 406 is the 

C3 sections store second sets of parity bits. first decoder processor in the cascade, and therefore has no 

A multiplexing and data control block 404 controls proceeding decoder processor from which to ^receive metric 

decoder processor access to the encoded data in the memory f^T^nZ^ ,L ^a a^a 2rt 

402. In the embodiment illustrated in FIG. 4, the multiplex- 5 decoder, SISO-2.1 of the second decoder processor 430 

. ' , . ... . An . . , . , ' K receives metric information output from the inverse per- 

mg and data control block 404 is divided into three sections^ muter ^ of ^ ^ sQr m E for the 

The first section, CI MUX controls access to CI sets of metfic . difference just discussed , the second decoder 

systematic bits. The second section, C2 MUX, controls proce ssor 430 performs the second decoding iteration on the 

access to C2 first sets of parity bits. The third section, C3 encoded data m Data Block x m the same manner as 

MUX, controls access to C3 second sets of parity bits. w discussed previously re g ar ding the first decoder processor 

The embodiment illustrated in FIG. 4 shows three cas- 406. 

caded decoder processors (406, 430, and 436). The data While the first decoder processor 406 is performing its 

input 408 of the first decoder SISO-1.1 of the first decoder respective decoding iteration on the encoded data in Data 

processor 406 initially receives data from the CI and C2 Block 2 and the second decoder processor 430 is performing 

sections of Data Block 1, via CI MUX and C2 MUX ils respective decoding iteration on the encoded data in Data 

respectively. The metric input 410 of SISO-1.1 receives a Block ^ new]y arrrv ing encoded data is stored in Data Block 

null input since the first decoder processor 406 is the first 3 when tne first decoder processor 406 completes the first 

decoder processor in the cascade of decoder processors. deC oding iteration on the encoded data in Data Block 2, and 

SISO-1.1 performs a decoding operation and produces two me seC0Qd decoder processor 430 completes the second 

outputs, a data output 412 and a metric output 414. The data decoding iteration on the encoded data in Data Block 1, the 

output 412 from SISO-1.1 is discarded. The metric output decoder processing of a next iteration begins. The output 

414 from SISO-1.1 is input to the permuter 416 of the first me trics from the inverse-permuter 426 of the first decoder 

decoder processor 406. The permuter 416 rearranges its processor 406 are applied to the metrics input 428 of the first 

input bits and sends them to the metric input 418 of the decoder, SISO-2.1, of the second decoder processor 430. 

second decoder SISO-1.2 of the first decoder processor 406. output me trics from the inverse-permuter 432 of the 

Trie data input 420 of SISO-1.2 initially receives data second decoder processor 430 are applied to the metrics 

from the C3 section of Data Block 1 via C3 MUX. SISO-1.2 input 434 of the first decoder, SISO-3.1, of the third decoder 

performs a decoding operation and produces two outputs, a processor 436. 

data output 422 and a metric output 424. The data output 422 3Q The multiplexing and data control block 404 now grants 

from SISO-1.2 is discarded. The metric output 424 of me third decoder processor 436 access to Data Block 1, 

SISO-1.2 is input to the inverse-permuter 426 of the first grants the second decoder processor 430 access to Data 

decoder processor 406. The inverse-permuter 426 rearranges Block 2, and grants the first decoder processor 406 access to 

its input bits, which are the metric bits output from SISO- a new block of data, Data Block 3. As the third decoder 

1.2, in a manner that is inverse to the bit rearrangement 35 processor 436 performs the third decoding iteration to be 

performed by the permuter 416 and sends the rearranged performed on the encoded data in Data Block 1, the second 

input metric bits to the metric input 428 of SISO-2.1 of the decoder processor 430 performs the second decoding itera- 

second decoder processor 430. t ion to be performed on the encoded data in Data Block 2, 

While the first decoder processor 406 is performing its and the first decoder processor 406 performs the first decod- 

decoding iteration on the encoded data stored in Data Block 40 ing iteration to be performed on the encoded data in Data 

1, newly arriving encoded data is stored in Data Block 2. Block 3. The decoding iteration performed by the first 

When the first decoder processor 406 completes its decoding decoder processor 406 on the encoded data in Data Block 3 

iteration of the encoded data in Data Block 1, resulting in the follows the discussion above for the decoding iteration 

sending of the metrics from the inverse-permuter 426 of the performed by the first decoder processor 406 on Data Block 

first decoder processor 406 to the metrics input 428 of 45 1 . The decoding iteration performed by the second decoder 

SISO-2.1 of the second decoder processor 430, the decoder processor 430 on the encoded data in Data Block 2 follows 

processing of a next iteration begins. The multiplexing and the discussion above for the decoding iteration performed by 

data control block 404 now grants the second decoder the second decoder processor 430 on Data Block 1. 

processor 430 access to Data Block 1 and grants the first The third decoder processor 436 performs the third decod- 

decoder processor 406 access to a new block of data, Data 50 ing iteration to be performed on the encoded data in Data 

Block 2. As the second decoder processor 430 performs the Block 1 in a manner very similar to the second decoding 

second decoding iteration to be performed on the encoded iteration performed by the second decoder processor 430 on 

data in Data Block 1, the first decoder processor 406 Data Block 1. An important difference is that the metric 

performs the first decoding iteration to be performed on the input 428 of the first decoder, SISO-2.1, of the second 

encoded data in Data Block 2. The decoding iteration 55 decoder processor 430 receives the output metric from the 

performed by the first decoder processor 406 on the encoded inverse-permuter 426 of the first decoder processor 406, and 

data in Data Block 2 follows the discussion above regarding the metric input 434 of the first decoder, SISO-3.1, of the 

the decoding iteration performed by the first decoder pro- third decoder processor 436 receives the output metric from 

cessor 406 on Data Block 1. the inverse permuter 432 of the second decoder processor 

The second decoder processor 430 performs the second 60 430. Except for the metric input difference just discussed; 

decoding iteration to be performed on the encoded data in the third decoder processor 436 performs the third decoding 

Data Block 1 in a manner similar to the first decoding iteration on the encoded data in Data Block 1 in the same 

iteration performed by the first decoder processor 406. An manner as discussed previously regarding the second 

important difference between the first decoder processor 406 decoder processor 430. 

and the second decoder processor 430 is that the metric input 65 As the third decoder processor 436 performs the third 

410 of the first decoder, SISO-1.1, of the first decoder decoding iteration on Data Block 1, the metric output data 

processor 406 received a null input. As discussed above, the from SISO-3.1 and the metric output data from SIS 0-3. 2 
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(through inverse-permuter 442) is synchronized, added, and cascade. The nth decoder processor receives metrics 602 
analyzed by a data decision element 437 to produce the final from the (n-l)th decoder processor in the cascade. The nth 
set of decoded data corresponding to the encoded data in decoder processor then acquires access 604 to the data block 
Data Block 1. The metric output 438 from SISO-3.1 is corresponding to the received metrics. The nth decoder 
connected to a delay element 444. The metric output 440 5 processor in the cascade then performs the nth decoding 
from SISO-3.2 is connected to the inverse-permuter 442 of iteration 606 on the block of data. The metrics resulting from 
the third decoder processor 436. The delay element 444 the nth decoding iteration are then sent 608 to the (n+l)th 
output 446 and the inverse-permuter 442 output 448 are decoder processor in the cascade. A decision 610 is then 
connected to an adder 450. The adder 450 output 452 is made. If there are more blocks of encoded data to be 
connected to the input of a decision processor 454. The Q decoded with the nth decoding iteration, the process begins 
output of the decision processor 454 is the decoded data again at the metrics receiving step 602. If there are no more 
corresponding to the encoded data in Data Block 1. Now that blocks of encoded data to be decoded with the nth decoding 
the encoded data in Data Block 1 has been decoded, the iteration, the processing of nth decoding iterations is corn- 
memory space in Data Block 1 is made available for storing plete 612. 

the next block of encoded data received. 15 FIG. 7 illustrates the method 700 followed by the last 
While the first decoder processor 406 is performing its decoder processor in the cascade. The last decoder processor 
respective decoding iteration on the encoded data in Data receives metrics 702 from the next-to-last decoder processor 
Block 3, and the second decoder processor 430 is perform- in the cascade. The last decoder processor then acquires 
ing its respective decoding iteration on the encoded data in access 704 to the data block corresponding to the received 
Data Block 2, and the third decoder processor 436 is 20 metrics. The last decoder processor in the cascade then 
performing its respective decoding iteration on the encoded performs the last decoding iteration 706 on the block of data, 
data in Data Block 1, newly arriving encoded data is stored The metrics resulting from the last decoding iteration are 
in Data Block 4. When the first decoder processor 406 then sent 708 to a data decision element. A decision 710 is 
completes the first decoding iteration on the encoded data in then made. If there are more blocks of encoded data to be 
Data Block 3, and the second decoder processor 430 com- 25 decoded with the last decoding iteration, the process begins 
pletes the second decoding iteration on the encoded data in again at the metrics receiving step 702. If there are no more 
Data Block 2, and the third decoder processor 436 completes blocks of encoded data to be decoded with the last decoding 
the third decoding iteration on the encoded data in Data iteration, the processing of last decoding iterations is corn- 
Block 1, the decoder processing of a next iteration begins. plete 712. 

The multiplexing and data control block 404 now grants 30 The processing performed by the first cascaded decoder 

the third decoder processor 436 access to Data Block 2, processor, nth cascaded decoder processor, and last cascaded 

grants the second decoder processor 430 access to Data decoder processor were discussed separately. However, it 

Block 3, and grants the first decoder processor 406 access to should be understood that although the decoding iterations 

a new block of data, Data Block 4. As the third decoder on a given block of data are performed sequentially, decod- 

processor 436 performs the third decoding iteration to be 35 ing iterations are performed on different blocks of data 

performed on the encoded data in Data Block 2, the second simultaneously. For example, while a first block of data is 

decoder processor 430 performs the second decoding itera- being decoded with a third decoding iteration, a second 

tion to be performed on the encoded data in Data Block 3, block of data is being decoded with a second decoding 

and the first decoder processor 406 performs the first decod- iteration, and a third block of data is being decoded with a 

ing iteration to be performed on the encoded data in Data 40 first iteration. 

Block 4. The decoding iterations and final data decision The metrics resulting from the last decoding iteration that 

processing follow the previous discussion. While this next are sent 708 to the data decision element are analyzed by the 

round of decoding iterations are performed on the encoded data decision element to produce the final output decoded 

data in Data Block 2, Data Block 3 and Data Block 4, newly data. The analysis performed by the data decision element 

arriving encoded data is stored in Data Block 1. The iterative 45 may consist of a parity test, where the arithmetic sign of a 

decoding cycle discussed above continues until all of the metric indicates the bit value of the corresponding decoded 

encoded data received is decoded. output bit. The output of the data decision element is 

FIGS. 5, 6, 7 and 8 illustrate a method according to a preferably a bit stream of decoded data, 

preferred embodiment of the present invention for iteratively FIGS. 5, 6 and 7 illustrate a method for iteratively 

decoding encoded data using cascaded decoder processors. 50 decoding encoded data using cascaded decoder processors 

FIG. 5 illustrates the method 500 followed by the first from the perspective of the cascaded decoder processors, 

decoder processor in the cascade. As encoded data arrives, FIG. 8 illustrates the method 800 for iteratively decoding 

the first decoder processor in the cascade acquires access encoded data using cascaded decoder processors from the 

502 to the memory block in which the encoded data is perspective of a block of encoded data. The arrival of a block 

stored. The first decoder processor in the cascade then 55 of encoded data 802 initiates the decoding process, A first 

performs the first decoding iteration 504 on the block of decoding iteration 804 is performed on the block of encoded 

data. The metrics resulting from the first decoding iteration data using a first decoder processor. A done decision 806 is 

are then sent 506 to the second decoder processor in the then made based on the number of decoding iterations 

cascade. A decision 508 is then made. If there are more performed so far on the block of encoded data relative to the 

blocks of encoded data to be decoded with the first decoding 60 predetermined total number of decoding iterations that are to 

iteration, the process begins again at the data access acqui- be performed. If there are remaining decoding iterations to 

sition step 502. If there are no more blocks of encoded data be performed on the block of encoded data, preparations 808 

to be decoded with the first decoding iteration, the process- are made for the next decoding iteration to be performed; 

ing of first decoding iterations is complete 510. The preparations consist of providing the next cascaded 

FIG. 6 illustrates the method 600 followed by the nth 65 decoder processor performing the next decoding iteration 

decoder processor in the cascade, where the nth decoder with the output metrics from the most recent decoding 

processor is not the first or last decoder processor in the iteration and with access to the encoded data block being 
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decoded. The next decoding iteration 810 is then performed 
on the block of encoded data. The done decision 806 is then 
made again. The preparation 808 and decoding iteration 810 
steps are performed repeatedly until the done decision 806 
determines that the predetermined total number of decoding 
iterations to be performed on the encoded data block have 
been performed. Once this decision is made, the metrics 
from the last decoding iteration are passed 812 to a data 
decision element. Decoded output data is then generated 814 
by the data decision element based on the metrics passed to 
it. Since the encoded data is no longer needed after it is 
decoded, the memory block containing the encoded data is 
made available for other data 816. Although the decoding 
iteration step 810, last metric passing step 812, and output 
generating step 814 are illustrated sequentially in FIG. 8 for 
a block of encoded data, the metric passing step 812 and 
output generating step 814 preferably occur simultaneously 
with the last decoding iteration. This simultaneous process- 
ing is possible, because the metrics output from the last 
decoding iteration are preferably generated in the form of a 
sequential stream of metrics. 

While particular elements, embodiments and applications 
of the present invention have been shown and described, it 
will be understood, of course, that the invention is not 
limited thereto since modifications may be made by those 
skilled in the art, particularly in light of the foregoing 
teachings. It is therefore contemplated by the appended 
claims to cover such modifications as incorporate those 
features which come within the spirit and scope of the 
invention. 

What is claimed is: 

1. A cascaded processor decoder for decoding encoded 
data, the decoder comprising: 

a plurality of decoder processors, including a first cas- 
caded decoder processor and a second cascaded 
decoder processor; 

a metric output of said first cascaded decoder connected 
to a metric input of said second cascaded decoder 
processor; and 

a multiple -block memory connected to said decoder pro- 
cessors for storing encoded input data arranged in 
multiple data blocks. 

2. The cascaded processor decoder of claim 1, further 
comprising a plurality of multiplexers connected between 
said decoder processors and said memory, said multiplexers 
allowing any decoder processor to selectively access any 
data block in said memory. 

3. The cascaded processor decoder of claim 1, further 
comprising a data decision element for generating decoded 
output data, said data decision element connected to said 
second cascaded decoder processor. 

4. The cascaded processor decoder of claim 1, further 
comprising: 

a plurality of multiplexers connected between said 
decoder processors and said memory, said multiplexers 
allowing any decoder processor to access any data 
block in said memory; and 

a data decision element for generating decoded output 
data, said data decision element connected to said 
second cascaded decoder processor. 

5. The cascaded processor decoder of claim 1, further 
comprising: 

at least one multiplexer randomly accessing a data block 
of said multiple-block memory. 

6. A cascaded processor decoder for iteratively decoding 
data, the decoder comprising: 
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a plurality of decoder processors equal in number to a 
predetermined number of decoding iterations, said 
decoder processors including at least a first cascaded 
decoder processor and a second cascaded decoder 
5 processor; 

a metric output of said first cascaded decoder processor 
connected to a metric input of said second cascaded 
decoder processor; 

a multiple-block memory comprising data blocks at least 
10 equal in number to said decoder processors; 

a plurality of multiplexers connected between said data 
blocks and said decoder processors, said multiplexers 
allowing said decoder processors to selectively access 
1S any of said data blocks; and 

a data decision element for generating decoded output 
data, said data decision element connected to said 
second cascaded decoder processor. 

7. The cascaded processor decoder of claim 6 wherein 
20 each of said data blocks comprise: 

a systematic bits section; 
a first parity bits section; and 
a second parity bits section. 

8. The cascaded processor decoder of claim 7 wherein 
25 each of said decoder processors comprise: 

a first decoder connected to a first data block including an 
associated systematic bits section and an associated 
first parity bits section; 
30 a permuter connected to a metric output of said first 
decoder; 

a second decoder connected to said first data block 
including an associated second parity bits section, said 
second decoder connected to an output of said per- 
35 muter; and 

an inverse permuter connected to a metric output of said 
second decoder. 

9. The cascaded processor decoder of claim 8, wherein 
said data decision element comprises: 

40 a delay element connected to a metric output of said first 
decoder of said second cascaded decoder processor; 
an adder connected to an output of said delay element, 
said adder connected to an output of said inverse 
permuter of said second cascaded decoder processor; 

45 a data decision processor connected to an output of said 
adder, said data decision processor including a decoded 
data bit output. 

10. A method for iteratively decoding encoded data using 
a predetermined total number of decoding iterations, the 

50 method comprising: 

receiving encoded data into a data block in a multiple - 

block memory; 
accessing the encoded data in the data block by a first 
55 decoder processor through a plurality of multiplexers; 
and 

iteratively decoding said encoded data a predetermined 
total number of times with said cascaded decoder 
processors. 

60 11. The method for iteratively decoding encoded data of 
claim 10, further comprising: 

generating an output bit stream of decoded encoded data 

with a data decision element. 
12. The method for iteratively decoding encoded data of 
65 claim 10, wherein said step of iteratively decoding com-^ 
prises iteratively decoding among at least two intercon- 
nected cascaded decoder processors. 
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13. The method for iteratively decoding encoded data of 
claim 10, wherein said step of iteratively decoding com- 
prises decoding at most one iteration on each of said 
cascaded decoder processors. 

14. The method for iteratively decoding encoded data of 5 
claim 10, wherein said step of iteratively decoding com- 
prises: 

performing a first number of decoding iterations on said 
encoded data with a first decoder processor, and gen 
erating first resultant metrics; 

passing said first resultant metrics to a second decoder 
processor; 

accessing the encoded data in the data block by the second 
decoder processor through a plurality of multiplexers; 15 

performing a second number of decoding iterations on 
said encoded data and first resultant metrics with a 
second decoder processor and generating second result- 
ant metrics; and 

passing said second resultant metrics to a third decoder 20 
processor. 

15. The method for iteratively decoding encoded data of 
claim 14, further comprising: 
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performing a last decoding iteration with a last decoder 
processor, and generating last resultant metrics; 

passing said last resultant metrics from said last decoder 
processor to a data decision element; and 

performing data decisions with said data decision ele- 
ment. 

16. The method for iteratively decoding encoded data of 
claim 15 wherein said step of performing data decisions 
comprises generating decoded data based on a polarity of 
said last resultant metrics. 

17. The method for iteratively decoding encoded data of 
claim 15, wherein said step of generating last resultant 
metrics comprises: 

generating a first subset; 
generating a second subset; and 

generating said last resultant metrics by adding said first 
subset to said second subset. 

18. The method for iteratively decoding encoded data of 
claim 17 wherein said step of performing data decisions 
comprises generating decoded data based on a polarity of 
said last resultant metrics. 
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